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MODEM PASS-THROUGH PANACEA 
FOR VOICE GATEWAYS 



5 BACKGROUND OF THE INVENTION 

1. FIELD OF THE INVENTION 

[1001] The present invention generally relates to communications over packet networks, 
such as Internet Protocol ("IP") and, more particularly, to connecting communication devices 
over IP, such as connecting modem devices over IP. 

10 2. RELATED ART 

[1002] Today the traditional analog telephony is being rapidly replaced with digital IP 
telephony, which use various techniques to break voice into data packets for transmission 
over packet networks. Analog modems have been utilized to provide data, facsimile and 
voice communications over twisted pair telephone lines for decades. Modem is an analog-to- 

15 digital and digital-to-analog converter, which is capable of adapting a terminal or computer to 
an analog telephone line by converting digital pulses to audio frequencies and vice versa. 
Because a significant infrastructure is in place using such modems, the packet networks need 
to address and support modem communication. 

[1003] FIG. 1 illustrates a block diagram of a conventional communications network 100 
20 utilizing modems for communication over a packet network protocol, such as Internet 
Protocol, which may also be referred to as Modem over Internet Protocol ("Mo IP 55 ). As 
shown, communications network 100 includes first communication device 102 in 
communication with first gateway device 104 and second gateway device 114 in 
communication with second communication device 122. Communications network 100 
25 further includes a packet network protocol, such as IP 112 to provide communications 
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between first gateway device 104 and second gateway device 114. IP 112 implements the 
network layer (layer 3) of a network protocol, which contains a network address and is used 
to route a message to a different network or subnetwork. IP 112 accepts packets from the 
layer 4 transport protocol, such as Transmission Control Protocol ("TCP") or User Data 
Protocol ("UDP"), and adds its own header and delivers the data to the layer 2 data link 
protocol. TCP provides transport functions, which ensures that the total amount of bytes sent 
is received correctly at the other end. UDP, which is part of the TCP/IP suite, is an alternate 
transport that does not guarantee delivery. UDP is widely used for real-time voice and video 
transmissions where erroneous packets are not retransmitted. 

[1004] Devices 102, 104, 114 and 122 may include modems (modulator-demodulator). 
Modems may support a variety of data modulation standards, such as ITU (International 
Telecommunications Union) standards: V.22bis, V.34, V.90 or V.92, etc. Typically, modems 
have built-in error correction, such as MNP2-4 or LAPM (or V.42) and data compression, 
such as MNP5, V.42bis or V.44. Modems are also capable of supporting various voice and 
facsimile standards. 

[1005] The communication process begins when first communication device 102, e.g. 
first modem ("Ml"), originates a call to establish communications with second 
communication device 122, e.g. second modem ("M2"). First gateway device 104 ("Gl") 
receives the call and informs second gateway device 114 ( M G2 M ) of the call for M2 over 
communications network 100 and, as a result, G2 calls M2. 

[1006] Typically, in the default mode of operation, Gl and G2 communicate in voice 
mode and use compressed voice protocol, such as the ITU standard G.723. 1 . Conventionally, 
after M2 receives the call from G2, M2 answers the call and starts sending certain defined 
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initiating signals, such as an answer tone, which is typically transmitted at about 2100 Hz 
frequency. At this point, G2 starts confirming the answer tone for a pre-defined period of 
time, e.g. 50 ms to 1000 ms. Once G2 confirms the answer tone, G2 informs Gl that the 
present communication session is a modem or facsimile session. 
5 [1007] However, once G2 detects the answer tone, e.g., 2100 Hz tone, from M2, then G2 
and Gl switch to an uncompressed voice protocol, such as an ITU standard G.711, which 
provides toll quality audio at 64 Kbps using either A-Law or mu-Law pulse code modulation 
methods. This uncompressed digital format is used in order to allow easy connections to 
legacy telephone networks. By switching to G.711, the signals generated by M2 may 

10 propagate through from G2 to Gl in a more intact manner in order to reach the first modem 
at the other side, which is known as modem pass-through mode of MoIP. 
[1008] The above-described modem pass-through approach, however, has many 
drawbacks and disadvantages. For example, because there are many different types of data 
terminals (i.e. modem terminals, facsimile terminals, etc.) that must be detected by the 

1 5 gateways, not only accommodating the detection of many types of data terminals increases 
the complexity of a gateway device, but also there may always be certain data terminals that 
are not foreseen and cannot be detected by the gateway device. For instance, not all modems 
may transmit an answer tone upon answering the call, and some modems that do transmit an 
answer tone may not transmit the answer tone at 2100 Hz. Further, certain proprietary point- 

20 of-sale (POS) modems, such as VeriFone or Hypercom POS terminals may implement 
proprietary training methods that transmit other signals upon answering the call. Also, some 
proprietary training methods may shorten the answer tone duration to reduce the training 
time, such that the gateway device may not be able to properly detect the answer tone. 
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Therefore, there are many instances in which the gateway device may not detect the data 
terminal properly and, as a result, the gateway device does not configure itself to 
accommodate the data terminal by, for example, switching from a low quality voice coder to 
a high bandwidth voice coder, such as G.711, freezing the jitter buffer, adjusting the jitter 
5 buffer size, etc. The U.S. Patent Application Serial No. 09/553,773, filed April 21, 2000, 
entitled "Methods and Apparatus for Data Communication on Packet Networks" describes 
gateway configuration for accommodating modem communication over the packet network, 
which is hereby incorporated by reference in its entirety. In the event that the gateway device 
fails to configure itself to accommodate end-to-end modem training, the modem training may 

10 fail or the modems may connect at lower speeds due to undesirable line conditions caused by 
a low bandwidth coder, such as G.723.1 coder. Further, even if a modem connection is 
established utilizing a low bandwidth coder, delays and data loss may be experienced. 
[1009] Another problematic area that has been extremely challenging with the 
conventional approach is finding a solution to accommodate or connect TTY (Text 

15 Telephone) modems over a packet network. TTY modems are also sometimes referred to as 
a TDD (Telecommunication Device for the Deaf) modems, which are special devices that 
allow deaf, hard of hearing, or speech- impaired people to use the telephone to communicate. 
Telephone communication for such users is accomplished by allowing them to type messages 
back and forth to one another instead of talking and listening. A TTY modem is required at 

20 both ends of the conversation in order to establish a telephone communication. However, 
TTY modems do not use the same communication protocol as regular data modems and, 
thus, TTY modems require the gateways to perform a different detection algorithm for 
detecting TTY modems in order to configure the gateways properly. Even worse, there are 
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several different TTY modems, such as Baudot @ 45.45 baud (U.S. TTYs); Baudot @ 50 
baud (international Baudot used in Australia, England, and other countries); V.21/text 
telephone version (Sweden, Norway, and Finland); DTMF( Denmark, Holland, other 
countries); and EDT ("European Deaf Telephone", Germany, Austria, Switzerland, and other 
5 countries, which are the focus of a unifying standard called ITU V. 1 8. 

[1010] Therefore, the conventional approach for supporting modems over packet 
networks requires development and testing of numerous modem detection algorithms to 
detect various data modem protocols, facsimile modem protocols, TTY modem protocols, 
etc. Even yet, the gateway devices may not be able to properly detect all modems because of 
10 proprietary protocols used by some modem manufacturers. Further, the development and 
testing of the algorithms to detect numerous modems are quite time consuming, let alone the 
sustaining efforts, the memory space consumed by the algorithms, possibility of a false 
detection that can result in call termination or low speed connection, the added execution 
time, and so on. 

15 [1011] Accordingly, there is an intense need for an efficient cure-all approach or a 
panacea to modem pass-through for connections over packet networks, where all types of 
modems can be accommodated, and any possibility of call termination or low speed 
connections due to improper gateway configuration can be avoided. 
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SUMMARY OF THE INVENTION 
[1012] In accordance with the purpose of the present invention as broadly described 
herein, there are provided methods, systems and devices for use by a first gateway to provide 
a communicating path to a user placing a call on a communication line and through a packet 
5 network to a second gateway. The first gateway has a plurality of modes of operation 
including a data mode and a voice mode, wherein the first gateway is configured differently 
for each of the modes of operation. In one aspect, a communication method for use by the 
first gateway comprises the steps of configuring the first gateway to the data mode of 
operation; receiving a call from the user over the communication line; enabling the first 

10 gateway to detect human voice and/or silence on the communication line; maintaining the 
first gateway configured according to the configuring step in the data mode of operation if the 
first gateway does not detect human voice or silence on the communication line; and 
reconfiguring the first gateway to the voice mode if the first gateway detects human voice or 
silence on the communication line. 

15 [1013] In a further aspect, the method further comprises the step of informing the second 
gateway over the packet network of the mode of operation of the first gateway after the steps 
of maintaining and reconfiguring. Also, in one aspect, the step of maintaining occurs if the 
first gateway does not detect human voice or silence on the communication line for a 
predetermined period of time. In another aspect, the method further comprises the step of 

20 informing the second gateway over the packet network of the mode of operation of the first 
gateway if the first gateway detects human voice or silence on the communication. 
[1014] In one aspect, the data mode is a modem mode and the user is a modem device, a 
fax device, or a TTY modem. Further, in the data mode the first gateway uses a voice coder 
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with higher bandwidth than in the voice mode. For example, in the data mode the first 
gateway uses a G.711 voice coder and in the voice mode the first gateway uses a G.723.1 
voice coder. 

[1015] In yet another aspect, the first gateway has a jitter buffer, wherein the jitter buffer 
5 is larger in the voice mode than in the data mode. Also, the jitter buffer is frozen in the data 
mode and is dynamic in the voice mode. 

[1016] According to other aspects, systems, devices and computer software products or 
media for use by the first gateway in accordance with the above techniques are provided. 
[1017] These and other aspects of the present invention will become apparent with 
10 further reference to the drawings and specification, which follow. It is intended that all such 
additional systems, methods, features and advantages be included within this description, be 
within the scope of the present invention, and be protected by the accompanying claims. 
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BRIEF DESCRIPTION OF DRAWINGS 
[1018] The features and advantages of the present invention will become more readily 
apparent to those ordinarily skilled in the art after reviewing the following detailed 
description and accompanying drawings, wherein: 
5 [1019] FIG. 1 illustrates a block diagram of a conventional communications network 
utilizing modems for communication over a packet network protocol; 

[1020] FIG. 2 illustrates a block diagram of a gateway device according to one 
embodiment of the present invention; and 

[1021] FIG 3. Illustrates a flow diagram of an algorithm for implementation by the 
1 0 gateway device of FIG. 2. 



030026/US 



Attorney Docket No. : 0 1 60 1 09 

DESCRIPTION OF EXEMPLARY EMBODIMENTS 
[1022] The present invention may be described herein in terms of functional block 
components and various processing steps. It should be appreciated that such functional 
blocks may be realized by any number of hardware components and/or software components 
configured to perform the specified functions. For example, the present invention may 
employ various integrated circuit components, e.g., memory elements, digital signal 
processing elements, logic elements, and the like, which may carry out a variety of functions 
under the control of one or more microprocessors or other control devices. Further, it should 
be noted that the present invention may employ any number of conventional techniques for 
data transmission, signaling, signal processing and conditioning, tone generation and 
detection and the like. Such general techniques that may be known to those skilled in the art 
are not described in detail herein. 

[1023] It should be appreciated that the particular implementations shown and described 
herein are merely exemplary and are not intended to limit the scope of the present invention 
in any way. For example, although the present invention is described using a modem over IP 
network, it should be noted that the present invention may be implemented in other 
communications networks and is not limited to modem over IP. 

[1024] FIG. 2 illustrates a block diagram of gateway device (G2) 214 according to one 
embodiment of the present invention. G2 214 has the same location as G2 114 in 
communications network 100. In one embodiment, a process begins at step 302 of FIG. 3, 
and at step 304, G2 214 receives a call request over a packet network, e.g. IP 1 12, from Gl 
104. As explained above, the call process initiated when a user, such as a human, modem 
device, fax device, TTY device, etc. places a call to Gl 104, which in turn transmits a call 
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request to G2 214. Prior to or upon receiving the call request, at step 306, configuration 
module 210 configures various components of G2 214 for a data mode based on the 
assumption that the call is a data call, such as a modem call or a fax call. Accordingly, 
configuration module 210 configures speech encoder 230 and speech decoder 250 to use a 
5 high quality voice coder, such as G.71 1, which utilizes 64kbps of bandwidth, to ensure that 
modem signals from M2 122 can reach Gl 104 in substantially intact manner. In some 
embodiments, other high quality voice coders may be used, and also speech encoder 230 and 
speech decoder 250 may be part of a single codec. Configuration module 210 may be any 
microprocessor or controller, and the codec may support various speech coding standards, 

10 such as G.726, G.729, G.723.1, G.71 1, etc. 

[1025] Further, at step 306, configuration module 210 may adjust the size of jitter buffer 
240 and may also freeze jitter buffer 240, so its size does not vary dynamically. For example, 
configuration module 210 may set the size of jitter buffer 240 to be smaller than a typical 
jitter buffer size for a voice call. In addition, at step 306, configuration module 210 enables 

15 echo canceller 220. In one embodiment, configuration module 210 may configure various 
components of G2, as described in the U.S. Patent Application Serial No. 09/553,773, filed 
April 21, 2000, entitled "Methods and Apparatus for Data Communication on Packet 
Networks", which describes gateway configuration for accommodating modem 
communication over the packet network. 

20 [1026] Next, at step 308, configuration module 210 enables voice and/or silence detector 
215 to monitor signals on communication line or phone line 118, which are generated by the 
user of phone line 118, such as M2 122, a fax device, a TTY device, a human, etc. In one 
embodiment, voice and/or silence detector 215 may also detect music. It should be noted that 
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voice activity detectors (VADs), and silence and music detectors are well known to those of 
ordinary skill in the art. At step 310, if voice and/or silence detector 215 does not detect 
human voice or silence, for a predetermine period of time, such as five seconds, the process 
moves to step 314, where G2 214 maintains the default configuration, as described above. 
5 However, if voice and/or silence detector 215 detects human voice or silence, the process 
moves to step 312, where configuration module 210 reconfigures G2 214 in a voice mode for 
processing a voice call. The voice call configuration may include configuring speech encoder 
230 and speech decoder 250 for a lower bandwidth voice coder, such as G.723.1, which 
utilizes 5.3 or 6.4kbps of bandwidth. Further, configuration module 210 may increase the 
10 size of jitter buffer 240 for supporting the voice call and may provide for dynamic jitter 
buffer. 

[1027] In one embodiment, the process may move from step 314 to step 316, where for 
some time, G2 214 monitors phone line 118 to determine whether a modem signal is 
received, e.g. from M2 122. For example, as in the conventional approach, G2 214 may 
15 monitor phone line 1 18 for a modem answer tone. If a modem signal is detected at step 316, 
the process moves to step 312. 

[1028] At step 318, once G2 214 determines the call type, G2 214 informs Gl 104 over 
the packet network regarding the same, and Gl 104 will be able to configure its speech 
encoder, speech decoder, jitter buffer, echo canceller, etc. accordingly. For example, if voice 
20 and/or silence detector 215 does not detect human voice or silence, G2 214 may inform Gl 
104 to configure itself for a modem call. In one embodiment, if voice and/or silence detector 
215 does not detect human voice or silence, G2 214 may not transmit any information to Gl 
104, and Gl 104 may remain in its default configuration, as described in step 306 above, 
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which is to support a modem call. However, if voice and/or silence detector 215 detects 
human voice or silence, G2 214 informs Gl 104 that the current call is a voice call, such that 
Gl 104 can configure itself similar to step 312 above for supporting the voice call. 
[1029] One of the advantages of the present invention is that even if human voice or 
5 silence on phone line 118 is not properly detected, the voice call will not be lost or 
terminated, but to the contrary, a higher quality voice communication is provided to the users. 
This is in sharp contrast to the conventional approach, where the gateway device is 
configured to support voice calls and to detect modem signals, and if the modem signals are 
not properly detected, a low quality voice coder, such as G. 723.1 is utilized, which will cause 

1 0 modem call termination and/or significant reduction in modem speed. 

[1030] Referring back to FIG. 3, the process of FIG. 3 equally applies to fax calls, where 
G2 214 receives a call from a fax device on phone line 118. In that event, fax signals, such as 
a fax calling tone, will cause the process to move to steps 314, 316 and 318, where G2 214 
informs Gl 104 of the fax call. As stated above, after step 314, steps 316 and 318 are 

1 5 optional steps and may not exist in some embodiments. 

[1031] The methods and systems presented above may reside in software, hardware, or 
firmware on the device, which can be implemented on a microprocessor, digital signal 
processor, application specific IC, or field programmable gate array ("FPGA"), or any 
combination thereof, without departing from the spirit of the invention. Furthermore, the 

20 present invention may be embodied in other specific forms without departing from its spirit 
or essential characteristics. The described embodiments are to be considered in all respects 
only as illustrative and not restrictive. 
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